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Abstract. Let G be a group given by the presentation 

(ai, . . . ,a k ,bi, . . .b k \ai = «j(6), 6, = Vi(a) for 1 < i < k), 

where k > 2 and where the u; S F(6i, . . . , &&) and £ F(ai, . . . ,a k ) are 
random words. Generically such a group is a small cancellation group and it 
is clear that (ai, . . . , aj;) and (&i, . . . , b k ) are generating n-tuplcs for G. We 
prove that for generic choices of «i, . . . , Uj, and v\, . . . ,v k the "once-stabilized" 
tuples (ai, . . . , 1) and (6i, . . . , b k , 1) are not Nielsen equivalent in G. This 
^ ■ provides a counter-example for a Wiegold-type conjecture in the setting of 

word-hyperbolic groups. We conjecture that in the above construction at least 
k stabilizations are needed to make the tuples (ai, . . . , a k ) and (bi, . . . , b k ) 
Nielsen equivalent. 



1. Introduction 

For a group G and an integer k > 1 , Nielsen equivalence is an equivalence relation 
on the set of ordered fc-tuples of elements of G. Let T = (<7i, • ■ • , G G k and 
T' = (g[, . . . , g' k ) G G k be two fc-tuples. Then 7~ and T 7 are elementary equivalent 
(write T ~ e 7"') if one of the following holds: 

(1) There exists some a G Sk such that g\ = g a u\ for 1 < i < k. 

(2) g[ = g^ 1 for some i = 1, . . . , k and g'j = gj for j =/= i. 

(3) g[ = gigj for some i ^ j and g' t = gt for t ^ i. 

The above transformations are called a Nielsen transformations or Nielsen moves. 
Two tuples 7" and T' are called Nielsen equivalent or simply equivalent if there exists 
some finite sequence 7o, 71, ■ • • ,77 such that 

T = 7o 71 ~ e • • . ^ e 7/ = 7" . 

Jacob Nielsen introduced this equivalence relation to study subgroups of free 
groups. Among other things, he showed that in the free group F n = F{x\, . . . , x n ) 
any generating fc-tuple is Nielsen equivalent to the tuple (xi, . . . , x n , 1, . . . , 1) (so 
that, necessarily k > n). In particular any two generating fc-tuples of F n are 
Nielsen equivalent. This result, together with the fact that any Nielsen move on 
a basis of F n induces an automorphism of F n , implies the following alternative 
characterization of Nielsen equivalence: 

Two fc-tuples T = (gi,...,g k ) & G k and V = (g[,...,g' k ) G G k are Nielsen 
equivalent if and only if there exists a homomorphism cf> : Fk — > G and an automor- 
phism a of Fk such that the following hold: 

(1) gi = <fi(xi) for 1 < i < k. 

(2) g[ = <j)oa(xi) for 1 < i < k. 

Let Fk = F(x\ , . . . , Xk) be a free group of rank fc with a fixed free basis (x\ , . . . , Xk). 
There is a natural identification between the set Hom(Fk,G) of homomorphisms 
from Fk to G and the set G k of fc-tuples of elements of G. There is also a natural 
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left action of Aut(Fk) on Hom(Fk,G) by pre-composition. In view of the above 
remark, two fc-tuples of elements of G are Nielsen equivalent if and only if the 
corresponding elements of Hom(Fk, G) lie in the same Aut(Fk)-orbit. 

In general it is difficult to decide if two fc-tuples are Nielsen equivalent in a given 
group. 

If two tuples T = (gi,...,gk) € G k and T' = (g[ , . ■ ■ , g' k ) S G k are Nielsen equiv- 
alent then they generate the same subgroup of G, that is (T) = (T'} < G. Thus 
if two tuples generate different subgroups of G, the tuples are not Nielsen equiva- 
lent. However, this observation does not help in distinguishing Nielsen equivalence 
classes of tuples generating the same subgroup, in particular those tuples that gen- 
erate the entire group G (generating tuples). Under the identification of the set of 
fc-tuples in G with Hom(Fk,G) discussed above, the set of generating fc-tuples of 
G corresponds to the set Epi(F k , G) of epimorphisms from F k to G. 

The only exception is the case k = 2. A basic fact due to Nielsen shows that if 
2-tuples (31,(72) and (/ii,/i2) are Nielsen equivalent in G then [31,32] is conjugate 
to [hi, h2\ ±1 in G. No such criteria exist for fc > 3 and there are few known results 
distinguishing Nielsen-equivalence classes of generating fc-tuples for fc > 3. 

Note that even in the algorithmically nice setting of torsion-free word-hyperbolic 
groups the problem of deciding if two tuples are Nielsen equivalent is algorithmically 
undccidablc. Indeed the subgroup membership problem is a special case of this 
problem since two tuples (31, . . . ,g n , h) and (31, . . . , g n , 1) are Nielsen equivalent if 
and only if h G (31, . . . , g n ). This implies in particular that Nielsen equivalence is 
not decidable for finitely presented torsion-free small cancellation groups since they 
do not have decidable subgroup membership problem as shown by Rips [R] . 

Understanding Nielsen equivalence of generating fc-tuples for fc > 3 is particularly 
difficult, and the problem becomes even harder if fc > rank(G), where rank(G) is 
the smallest size of a generating set of G. 

Of particular interest here is the so-called Wiegold conjecture about generating 
tuples of finite simple groups. A generating fc-tuple for G is redundant if it con- 
tains a proper subtuple that still generates G. We say that a generating tuple 
is weakly redundant if it is Nielsen equivalent to a redundant tuple. Note that a 
redundant tuple (31, . . . , gk) is always Nielsen equivalent to a fc-tuple of the form 
(hi, . . . ,hk—i, 1)- Thus a generating tuple is weakly redundant if and only if it 
is Nielsen equivalent to a tuple containing a trivial entry. It is well-known, as a 
consequence of the classification of finite simple groups, that every finite simple 
group G is two-generated, so that rank(G) < 2. The Wiegold conjecture says that 
if G is a finite simple group and fc > 3 then any two generating fc-tuples of G are 
Nielsen equivalent; in other words the conjecture says that the action of Aut(Fk) 
on Epi(Fk, G) is transitive in this case. Since G is two-generated and has a gener- 
ating fc-tuple of the form (a, b, 1, ... , 1), this implies that any generating fc-tuple of 
G is redundant. The Wiegold conjecture is closely related to the so-called "prod- 
uct replacement algorithm" for finite groups and there is substantial experimental 
evidence and a number of partial theoretical results in favor of the validity of the 
Wiegold conjecture. We refer the reader to |Pa[ ILGM1 ILP| for a more extensive 
discussion of this topic. 

One obvious way of producing redundant tuples is via the so-called "stabiliza- 
tion" moves. A stabilization move on a fc-tuple (31, . . . , gk) gives a (fc + l)-tuple 
(31, . . . ,3/j, 1). It is easy to see that for any generating fc-tuples (31, . . . ,gu) and 
(hi,...,hk) of a group G, applying fc stabilization moves to each of them pro- 
duces Nielsen equivalent 2fc-tuples (31, . . . ,gk, 1, • • • , 1) and (hi,..., hk, 1, . . . , 1). 
The Wiegold conjecture implies that for any two generating pairs (a, b) and (01, 61) 
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of a finite simple group G, the once-stabilized tuples (a, 6, 1) and (oi,6i,l) are 
Nielsen equivalent. 

Let us mention here the (few) known results on distinguishing Nielsen equivalence 
of generating /c-tuplcs for infinite groups. Apart from the special and much easier 
case of k = 2, these can be mostly separated into two distinct approaches. 

The first one is algebraic ( A"-theoretic) . The earliest work is due to Noskov |No] 
who showed that there exist non-minimal generating tuples that are not Nielsen 
equivalent to a tuple containing the trivial element and thereby giving a negative 
answer to a question of Waldhausen. These results where later generalized by Evans 

m- 

Lustig and Moriah [LM1] . |LM2j . |LM3j used algebraic methods to distinguish 
Nielsen equivalence classes of Fuchsian groups and other groups with appropriate 
presentations. This enabled them to distinguish isotopy classes of vertical Heegaard 
splittings of Seifert manifolds. 

Recently Evans |E21 IE3J . for any given number N, produced large generating 
tuples of mctabelian groups that do not become Nielsen equivalent after adding the 
trivial element to the tuples N times, making those the first examples of this type 
in the literature even for the case N = 1. The generating tuples however are much 
bigger than the rank of the group. 

The second approach is combinatorial-geometric and is closer in spirit to Nielsen's 
original work, it relies mostly on using cancellation methods. First in line is 
Grushko's theorem [G] which states that any generating tuple of a free product 
is Nielsen equivalent to a tuple of elements that lie in the union of the factors. To- 
gether with recent work of the second author [Wj this implies that Nielsen equiva- 
lence of irreducible generating tuples in a free product is decidablc iff it is decidablc 
in the factors. 

Zieschang [Z] proves that any minimal generating tuple of a surface group is 
Nielsen equivalent to the standard generating tuple and proves a similar result 
for Fuchsian groups that lead to the solution of the rank problem |PRZ| . Nielsen 
equivalence in Fuchsian groups has been studied by many authors. Recently Louder 
[L] has generalized Zicschangs result to arbitrary generating tuples. 

The finitencss of Nielsen equivalence classes of fc-tuples for torsion-free locally 
quasiconvcx-hypcrbolic groups has been established by the authors |KWj general- 
izing a result of Delzant [D] who studied 2-generated groups. The first author and 
Schupp [KS] have recently established uniqueness of the Nielsen equivalence class 
of minimal generating tuples for a class of groups closely related to the one studied 
in the present article. 

The main result of this paper is the following theorem, which implies in particular 
that there exist 2-generated torsion-free word-hyperbolic groups that have gener- 
ating pairs (01,02) and (61,62) such that (oi,02)l) and (61,62,1) are not Nielsen 
equivalent. See Section [3] for precise definitions and notations related to genericity. 
If k > 2, m > 2 are integers, for the free group F(a\, . . . , a^) we denote by C m ^A the 
set of all m-tuples {v\, . . . , v m ) of cyclically reduced words in F(ai, . . . , a^) such 
that \v\\ = ■ ■ • = \v m \. Similarly, denote by Ck.B the set of all /c-tuples (ui, . . . ,Uk) 
of cyclically reduced words in F(b\, . . . , bk) such that |wi| = • • • = \uk\- 

Theorem 1.1. Let k > 2,m > 2 be arbitrary and let G be a group given by the 
presentation 

(f) (01, . . . , Ofe, 61, . . . b m I di = Ui{b), bj = Vj(a) for 1 < i < k, 1 < j < m). 
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There exist generic subsets Uk Q Ck.B and V m C C m .A such that for any 
(u\, . . . , Uk) G Uh and any (v\, . . . , v rn ) G V m with \u±\ = \vi\, the group G given by 
presentation (f ) above has the following property: 

For any <?i, . . . ,gk—i G G the generating (k + 1) -tuple (ai, . . . ,afe, 1) of G is not 
Nielsen equivalent to a tuple (pi, &2j5ii • • • j 9k-i)- 

The theorem immediately implies that for the case k = m the generating (fc + 1)- 
tuples (ai, . . . , <Xfc, 1) and (&i, 1) are not Nielsen equivalent in G. However, 

we do believe that much more is true, namely that following holds: 

Conjecture 1.2. Let fc > m > 2. There exist generic subsets Uk Q Ck.B and 
Vm £ C m .A such that for any (u\, . . . , uu) G Uk and any (vi, . . . , v m ) G V m wii/i 
|ui| = i/ie group G given by presentation (t) above has the following property: 
For any 1 < t < m the generating (k + t) -tuple (ai, . . . , a,k, 1, ■ • ■ , 1) is not 
equivalent to a (k + t) -tuple of type (b\, . . . , bt+i,gi, ■ ■ • , <7fc-i)- 

In particular, if fc = m, the conjecture would imply that at least k stabilizations 
are needed in G in order to make the generating fc-tuplcs (a\, . . . , <Zfc) and (&i, . . . , bk) 
Nielsen equivalent. 

The conclusion of Theorem 11,11 implies that for G as in the assumption of the 
theorem with generic Ui and Vj, the generating fc-tuples (oi, . . . , au) and 
(bi, b%, . . . , 6 m , 1, . . . , 1) (where (k — m) > trivial entries are present in the second 
tuple) are not Nielsen equivalent in G. While the proof of Theorem 11.11 is very 
complicated, we also give a simple proof (see Theorem I4.4[) that in this situation 
(oi, . . . , a*) is not Nielsen equivalent to a fc-tuple of the form ,*,... , *). 

Although we do not prove it in this paper, for the case k = m > 2 and G 
as in Theorem 11.11 one can use the methods of |KSj to show that G has ex- 
actly two Nielsen-equivalence classes of generating fc-tuplcs, namely [a\, . . . ,a,k) 
and (bi,...,b k ). 

We are grateful to the referee for useful comments. 

2. Small cancellation theory 

Recall that a set R of cyclically reduced words in F = F(a\, . . . , afc) is sym- 
metrized if for every r G R all cyclic permutations of r^ 1 belong to i?. For a 
symmetrized set R C F(a\, . . . , afc), a freely reduced word u G F(oi, . . . , a^) is a 
piece with respect to R if there exist n , T2 G i?, such that n 7^ ^2 and such that u 
is an initial segment of each of r±, r%. 

Definition 2.1 (Small Cancellation Condition). Let R C F(ai, . . . , afc) be a sym- 
metrized set of cyclically reduced words. Let < A < 1. We say that R satisfies 
the C' (X) -small cancellation condition or that R is a C'(X)-set if, whenever v is a 
piece with respect to R and v is a subword of some r G R, then \v\ < X\r\. 

We say that a presentation G = (ai, . . . ,ak\R) satisfies the C'(X) -small cancel- 
lation condition if R C F(a\, . . . , a^) is a G'(A) set. 

The following fact is a well-known basic property of small cancellation groups |LSj : 

Proposition 2.2. Let G = (a\, . . . ,ak\R) be a C' (A) -presentation, where X < 1/6. 
Let w G F(a\, . . . ,a,k) be a nontrivial freely reduced word such that w =g 1. Then 
w has a subword u such that for some r G R the word u is a subword of r satisfying 
\u\ > (1 - 3A)|r|. 

Definition 2.3. Let G = (ai, . . . , ak\R) be a C"(A)-presentation, where A < 1/100. 
For a freely reduced word w G F(a±, . . . , a^) we say that w is Dehn reduced with 
respect to R if w does not contain a subword u such that u is also a subword of 
some r G R with |u| > |r|/2. 
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We say that a freely reduced word w € F(a\, . . . , flfe) is X-reduced with respect to 
R if w does not contain a subword u such that u is also a subword of some r G i? 
with |u| > (1 - 3A)|r|. 

Similarly, we say that a cyclically reduced word w € F(ai, . . . , ak) is X-cyclically 
reduced with respect to R if every cyclic permutation of w is A-rcduccd with respect 
to R. We also say that a cyclically reduced word w € F(a±, . . . , ak) is cyclically 
Dehn-reduced with respect to R if every cyclic permutation of w is Dehn-reduced 
with respect to R. 

Note that if A < 1/6 then any Dehn-reduced word is A-reduced. Proposition ^. 21 
says that for a C"(A)-presentation with A < 1/6, any nontrivial freely reduced and 
A-reduced word in F(a\, . . . , ak) represents a nontrivial element of G. 

The following statement follows from the basic results of small cancellation the- 
ory, established in Ch. V, Sections 3-5 of |LS] (see also [Str] ). 

Proposition 2.4. [Equality and Conjugacy diagrams in C (X) -groups] 
Let 

(*) G = {a 1 ,...,a k \R) 

be a C (X) -presentation, where X < 1/100. 

(1) Let wi,W2 S F(ai, . . . , ak) be freely reduced and X-reduced words such that 
wi =g U>2- Then any reduced equality diagram D over (*), realizing the 
equality ui\ =q w%, has the form as shown in Figured Specifically, any 
region Q of D labeled by r G R intersects both the upper boundary of D 
(labeled by ui\) and the lower boundary of D (labeled by Wi) in simple 
segments OL\,ct2 accordingly, satisfying 

A|r|<K-|<(l-3A)|r|. 

Moreover, if two regions Q, Q' in D, labeled by r, r' € R, have a common 
edge, then they intersect in closed simple segment 7 joining a point of the 
upper boundary of D with a point of the lower boundary of D and labeled by 
a piece with respect to R. In particular \j\ < X\r\ and |"y| < A|r'|. Further, 
if both w\ and w% are also Dehn-reduced, then for the segments ctj above 
we have \a 3 \ > (1/2 - 2A)|r| > |r|/3. 




w 2 

FIGURE 1 . Equality diagram in a small cancellation group 

(2) Let ui\,W2 € F(a\, . . . , ak) be nontrivial cyclically reduced and X-cyclically 
reduced words representing conjugate elements of G. Then there exists a 
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reduced conjugacy diagram D over (*) with the inner cycle boundary la- 
beled by a cyclic permutation of W2 and the outer cycle boundary labeled 
by a cyclic permutation of w± , of the form shown in Figure Specifically, 
any region Q of D labeled by r E R intersects both the outer boundary of 
D (labeled by a cyclic permutation of w\ ) and the inner boundary of D (la- 
beled by a cyclic permutation of W2) in simple segments U\,ct2 accordingly, 
satisfying 

A|r| < \a>j\ < (1 — 3A)|r|. 

Moreover, if two regions Q,Q' in D, labeled by r,r' £ R, have a common 
edge, then they intersect in closed simple segment 7 joining a point of the 
inner boundary of D with a point of the outer boundary of D and labeled by 
a piece with respect to R. In particular \~f\ < X\r\ and \ j\ < X\r'\. Further, 
if both ui\ and u>2 are also Dehn-reduced, then for the segments ctj above 
we have \a,\ > (1/2 - 2A)|r| > |r|/3. 




Figure 2. Conjugacy diagram in a small cancellation group 



Remark 2.5. Suppose that w\ and W2 are as in part (1) of Proposition 12.41 and 
suppose that W\ and W2 are Dehn-reduced (which is a stronger assumption than 
being A-reduced). Let Q be a region of D intersecting the upper and the lower 
boundaries of D in segments ot\ and 012- Let r £ R be the label of a boundary cycle 
of Q. Since the overlaps of Q with the neighboring regions in D have length at 
most A|r| each and since \ctj\ < r|/2 by Dehn-reduceness of w\ and W2, it follows 
that 

\aj\ > \r\ - 2X\r\ - \r\/2 = |r|(l/2 - 2A) > \r\^- > \r\/3, 
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since A < 1/100. Thus in this case 

|r|/3 < otj < \r\/2. 

Similar conclusions apply to case (2) of Proposition 12.41 if we assume Wi and Wi to 
be cyclically Dehn reduced there. 

Corollary 2.6. Let G = (a\, . . . ,ak\R) be a C"(A) presentation where X < 1/100. 
Suppose that for every r £ R we have \r\ > 2/A + 1. Then: 

(1) For i ^ j the elements af 1 and a^ 1 are not conjugate in G, and ai is not 
conjugate to a" 1 in G. 

(2) Suppose that wherever a? is a subword of some r G R, where 1 < s < k, 
then \p\ < X\r\. Then for any i ^ j and any m^0,n^0, the elements a™ 1 
and a™ are not conjugate in G. 

(3) Suppose that w is a cyclically reduced word in F(A) that is conjugate in G 
to a±. Then either w — a± in F(A) or w is not cyclically X-reduced. 

Lemma 2.7. Let G = (a±, . . . ,ak\R) be a C"(A) presentation where X < 1/100. Let 

da be the word-metric on G corresponding to the generating set {ai, . . . ,ak}- Let 
g G G, g 5^ 1 be arbitrary. Then there exist freely reduced words u, v G F(a±, . . . , afe) 
such that: 

(1) We have g =g vuv . 

(2) The word u is cyclically reduced and cyclically Dehn-reduced. Moreover, u 
is dc-geodesic and the element of G represented by u is of shortest possible 
length among all elements conjugate to g in G. 

(3) The word v is Dehn-reduced. 

(4) The word vuv is freely reduced, as written, and is X-reduced. 

(5) If z is any X-reduced word in F(a\, . . . , a^) representing g, then either z = 
vuv~ 1 in F(a\, . . . , Ofc) or there is some r G R such that both z and vuv -1 
contain subwords of r of length > X\r\. 

Proof. Consider all representations of g as g = hgoh -1 where go € G is shortest in 
the conjugacy class of g. Once go is fixed, among all such representations of g as 
g = hgoh~ l , choose one, g = hgoh -1 , where h G G is the shortest possible. Let 
u be a G-geodesic representative of go and let v be a G-geodesic representative of 
h. Note that the minimality in the choice of go implies that u is cyclically reduced 
and cyclically Dehn-reduced. Also, the minimality in the choice of h implies that 
the word vuv -1 is freely reduced as written. Since v is a geodesic word, it is Dehn- 
reduced. We claim that the word vuv -1 is A-reduced. Indeed, suppose not. Then 
vuv -1 contains a subword w such that w is also a subword of some r G R with 
M>(l-3A)|r|. 

Note that since u and v are Dehn-reduced, the subword w overlaps at least two 
of the subwords v, u, i> -1 in vuv~ 1 . 

Case 1. The word w is a subword of vu or of uv . We assume that w is 
a subword of vu, as the other case is symmetric. Then w = W1W2, where u^i 
is a terminal segment of v and W2 is an initial segment of u. We write v and 
u as v = v'w\ and u — w-2u' . Let y G ^(^4) be such that r = wy = w\W2y 
in F(A), so that |y| < 3A|r|. Note that since u and v are Dehn-reduced and 
\w\ > (1 - 3A)|r|, it follows that K|, \w 2 \ > (| - 3A)|r| > \r\/3 > 3A|r|. Observe 
also that v = v'w\ =g v'y^w^ 1 ■ Therefore 

g = G vuv^ 1 = (w'wi)(w2u')( u, r 1 ( w ') _1 ) =G 
=G v'y~ 1 w 2 ~ 1 W2u'w 2 y(v'y 1 = (v'y~ 1 )(u / w 2 )(y(v'y 1 ). 

Since \y\ < 3A|r| and |u>i| > 3A|r|, it follows that < \v\, contradicting the 

minimality in the choice of h. 
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Case 2. The subword w overlaps both v and w _1 in mi;' 1 

If the overlap of w with one of v, w _1 has length < A|r|, then either vu or uv^ 1 

contains a subword of r of length > (1 — 5A)|r|, and we get a contradiction similarly 

to Case 1. 

If the overlaps of w with each of v, v^ 1 have length > A|r|, we get a contradiction 
with the C"(A)-small cancellation condition. 

Thus part (1), (2), (3) and (4) of the lemma are established. Since the words 
too -1 and z are both A-reduced, part (5) of the lemma now follows from Proposi- 
tion E31 □ 



3. Genericity 

In this paper we work with the Arzhantseva-CTshanskii model of genericity in 
free groups (see |AQ[[KM SS. KSj) based on the asymptotic density considerations. 

Convention 3.1. Let F = F(A), where A = {oi, . . . , a^} and k > 2. Let m > 1 

and let U C F m be a subset of F m . For n > we denote by ja {n, U) the number 
of all m-tuples (tii, ■ • ■ j Km) £ U such that \ui\a = n for i = 1, . . . , m. Note that for 
n > 1 we have j A (n, F m ) = (2fc(2fc - l)"" 1 )" 1 . We say that U C F m is spherically 
homogeneous if for every (ui, . . . , u m ) S C/ we have \ui\a = • • • = |itm|A- Note that 
this restriction is vacuous if m = 1. Let SVn = S m ,A denote the set of all m-tuples 
(iti, . . . ,u m ) £ i 71 " 1 such that \u\\a = • • • = |"mU- 

Let C = Ca be the set of all cyclically reduced words in F(A). Let C m = C mi A = 
C m n Thus C m consists of all m-tuples (ui, . . . , u m ) of cyclically reduced words 
in F(A) such that = • • • = |it m |. 

Although the notion of genericity make sense for arbitrary subsets of F m , for 
reasons of simplicity we will restrict ourselves to spherically homogeneous subsets 
in this paper. Moreover, in applications we will only be concerned with tuples of 
cyclically reduced words. 

Definition 3.2. Let F = F(A), where A = {a 1: ...,a k } and k > 2. Let m > 1. 

Let fl C S m be a spherically homogeneous subset. Let U C fl. 

(1) We say that U is generic in if 

Um 7A( /^ = 1. 
n->oo 7yi(n, si) 

If, in addition, the convergence to 1 in the above limit is exponentially fast, 
we say that U is exponentially generic in il. 

(2) We say that a subset U C Q is negligible in Q (correspondingly exponentially 
negligible in Q) if fi \ U is generic (correspondingly exponentially generic) 
in il. 

We stress that the above notions of genericity and negligibility arc highly depen- 
dent on the choice of a free basis A of F. Therefore in all our discussions regarding 
genericity such a free basis is assumed to be fixed. 

The following is a straightforward corollary of the definitions: 

Lemma 3.3. Let U C C m . Then the following are equivalent: 
(1) The set U is exponentially negligible in C m . 



(2) We have 

with exponentially fast convergence. 



lim , lA{n > U) = 0, 
^oo (2k - l) nm 
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(3) We have 

limsup 1057 ^'^ <log(2fc-l). 

n—too TlTtl 

Similarly, the definitions imply: 

Lemma 3.4. Let m > 1. 

(1) The union of a finite number of (exponentially) negligible subsets of C m is 
(exponentially) negligible in C m . 

(2) The intersection of a finite number of (exponentially) generic subsets of C m 
is (exponentially) generic inC m . 

(3) Let U C C be an exponentially generic subset. Then U m l~l S m is exponen- 
tially generic in C m . 

Convention 3.5. We say that a certain property of ro-tuples of cyclically reduced 
elements of F = F(A) is generic (correspondingly, exponentially generic) if there 
exists a generic (correspondingly, exponentially generic) subset U C C m such that 
every m-tuplc in U has the property in question. 

We list some properties of m-tuples of cyclically reduced elements of F that are 
known to be exponentially generic (see |AQ1 IKS1 IKKSj L 

Proposition 3.6. Let F = F(A), where A = {a 1: . . . , ak} and k > 2. Let m > 1. 

Then: 

(1) The property that no element of an m-tuple is a proper power in F is 
exponentially generic in C m . 

(2) Let < A < 1 be arbitrary. Then the property that an m-tuple {u\, . . . , u m ), 
after cyclic reduction and symmetrization, satisfies the C"(A) small cancel- 
lation condition, is exponentially generic inC m . 

(3) The property that for an m-tuple (u\, . . . ,u m ) for every i ^ j the element 
Ui is not conjugate to u^ 1 in F, is exponentially generic in C m . 

(4) Let K > 1 be an integer and let < A < 1. Then the property of an m-tuple 

. . . , u m ) that every subword u of some ui of length > X\ui\ contains as 
a subword every freely reduced word of length < K in F(A), is exponentially 
generic in C m . 



4. STALLINGS FOLDS AND NIELSEN EQUIVALENCE 

In this section we review Stallings folds and prove a weak version of our main 
theorem. The proof of this special case is quite easy but still illustrates some of the 
ingredients of the proof of the general case. 

In a beautiful article Sta Stallings used graphs to represent subgroups of free 
groups and described how to use simple operations called folds to transform the 
graph into a graph at which a basis of the subgroup can be read off. We briefly 
discuss Stallings folds for the free group F = F(a\, . . . , au) and refer the reader to 
[KMj for more detailed background information. 

Put A = {ai, . . . , afe}. Let Ra be the directed labeled graph consisting of a single 
vertex vq and k loop-edges e\, . . . , with labels oi, . . . , afe, respectively. There is 
an obvious isomorphism 

(j> : tti(R a ,v ) F 

that maps the homotopy class represented by the loop edge e^, travelled in positive 
direction, to a,. 
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Figure 3. The graph Ra 

An A-graph is a directed labeled graph V with edge-labels from A. For any 
A- graph T there exists a unique label-preserving graph- map p : T — > Ra- Af- 
ter choosing a base- vertex x of T, the morphism p induces a homomorphism p* : 
7Ti(r, a;) -> 7Tl(i2A,Vo)- 

Given a tuple T = (jg\, . . . ,g s ) of elements from tti(Ra, vq) = F we can construct 
a graph St with base vertex xq such that p*(7ti(iSt,xo)) = (ffi) ■ * ■ as f°ll° ws - 
We assume that ^ ^ 1 for 1 < i < I < s and = 1 for I < i < s. 

(1) St is a wedge of I circles with base vertex xo where the «th circle is of 
simplicial length \gi\A- 

(2) For i = 1, . . . , I, the label of the i-th circle is the reduced word representing 
9i- 




Figure 4. The map p from St to Ra 

Stallings observed that any such A-graph St (and, more generally, any finite 
connected A-graph) can be modified by folds without changing the image of the 
induced homomorphism so that the resulting graph is folded, i.e. the associated 
morphism p is an immersion. A fold here is the identification of two edges that 
have the same label and the same initial vertex or the same terminal vertex. Note 
that one or both edges can be loop-edges. 

If T is a folded A-graph, then p : T —> Ra maps reduced edge-paths to reduced 
paths which implies that is injective. 

Lemma 4.1. Let T be a finite connected A-graph and Xq be a vertex of T such 
that p* : 7Ti(r,xo) t{i(Ra,vq) is surjective. Then there exists a finite sequence 
of A- graphs 

r = To, Ti, . . . , T n = Ra 
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Figure 5. A fold identifying two edges labeled a.i 

such that Ti can be obtained from by a fold for 1 < % < n. 

Proof. Note that each fold reduces the number of edges in a finite A-graph. Thus 
any sequence of folds, that starts with the graph T, must terminate. Let T n be 
obtained from T by a maximal sequence of folds, so that T n is folded. We claim 
that r„ = Ra- 

As r„ is folded, it follows that is injective for any choice of base point in 
r„. Let Xq the vertex of r ra be the image of xq under the folds. By assumption 
p* : 7Ti(r n ,Xg) — > tti(Ra,vq) is surjective. In particular, for each a, G A, there 
must exist a loop edge at Xq labeled with a*, as the homotopy class represented 
by Bi must be the image of some reduced path in T n . The assumption that T n is 
folded now easily implies that r n = Ra- □ 

Definition 4.2. We say that a finite connected graph T is a core graph, if it 
does not have any degree-one vertices. For a finite connected graph T with a 
nontrivial fundamental group let Core(T) be the unique smallest subgraph of T 
whose inclusion into L is a homotopy equivalence. 

Thus Core(F) is obtained from T by cutting off a (possibly empty) collection of 
tree-branches. Note that CoreiT) is a core graph. We need the following simple 
observation about ^4-graphs that fold onto Ra with a single fold. 

Let L be an A-graph. We say that an A-graph L' is L with a spike-edge, if V is 
obtained from T by adding an edge e such that the initial vertex lies in T and the 
terminal vertex does not. 

Lemma 4.3. Let T be a finite core A-graph of rank k. 

(1) If T folds onto Ra with a single fold then there exists a reduced word of 
length 2 that cannot be read as the label of an edge-path in Y . 

(2) If T folds with a single fold onto Ra with a spike-edge, then there exists a 
reduced word of length 4 that cannot be read as the label of an edge-path in 

r. 

Proof. (1) Note that the fold must identify a loop edge with a non-loop edge since 
identifying two loop edges decreases the rank and identifying two non-loop edges 
yields a graph with a non-loop edge. Thus w.l.o.g. we can assume that T has two 
distinct vertices x and y and the fold identifies a loop edge at x and an edge from 
x to y both with label a\. The graph T has k — 1 more edges that are labeled by 
a2, . . . , a/j. 

As we assume T to be a core graph, there must either exist a loop edge at y or 
a second edge from x and y. W.l.o.g. we can assume that this edge is labeled by 
<Z2, see Figure [HI In either case, we cannot read the word a^a\ in T as the path is 
at the vertex y after reading a^, this proves the assertion. 

(2) In this case the fold must identify two non-loop edge with distinct endpoints, 
i.e. we must be in the situation of Figure [71 If a reduced word is read by a path 
in r then the vertex at which the fold is based can only occur as the initial and 
the terminal vertex of this path. As there is clearly a word of length 2 that cannot 
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Figure 6. A core graph that folds onto Ra with a single fold 

be read in the graph spanned by the remaining vertices (the word is a±ai in the 
example), the claim follows. □ 




Figure 7. A core graph that folds onto Ra with a spike-edge 

We are now able to prove a weak version of our main theorem. This weak version 
is an easy consequence of Stalling folds and the above observations. However, one 
of the key ideas used in the proof of the main theorem already shows up in the 
proof of the weak version. 

Theorem 4.4. Let k > m > 2 and let A = {ai, . . . , a^} and B = {&i, . . . , b m } 
be free bases of F(A) and F(B). Then there exist exponentially generic subsets 
V C C m> A and U C Ck.B with the following properties. 

Let (ui(b), . . . , itfc(6)) £ U and (vi(a), . . . ,v m (a)) E V be arbitrary and let G be 
given by the presentation 

(!) G = (ai, . . . ,a k ,b 1} ...b m \ a t = Ui(b),bj = Vj(a) for 1 < i < k, 1 < j < m). 

Then the k-tuple (a±, . . . , ajt) is not Nielsen equivalent in G to a k-tuple of form 
(&i, *,...,*). 

Proof. We can choose exponentially generic sets V C C m ^A and U C Ck.B so that 
the following hold: 

(1) For every (ui(b), . . . ,Uk(p)) £ U and (vi(a), . . . , v m (aj) € V the presen- 
tation (!) satisfies the C' (1/100) small cancellation condition, after sym- 
metrization. 

(2) For every (ui(b), . . . , Uk(b)) € U and (vi(a), . . . , v m (a)) G V we have \u%\b > 
10 10 and \Vj\ A > 10 10 . 

(3) For every (vi(a), . . . , v m (a)) € V and every j = 1, . . . , m, every subword of 
Vj of length > |«j|/100 contains all freely reduced words of length two in 
F(A) as subwords. 

We now argue by contradiction. Assume that for some G as in Theorem l4.4l the 
/c-tuple (ai, . . . , afe) is Nielsen equivalent in G to a fc-tuple (pi,*, ... , *). This implies 
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that (oi, . . • , a*;) is Nielsen equivalent in F(A) to a tuple T = (wx(a), . . . , Wk{a)) 
such that wi(a) = b\ in G. Note that we might have wi(a) ^ vi(a) in F(A). We do 
not distinguish between the Wi(a) and freely reduced words in F(A) representing 
them. After a possible conjugation of T in F(A), we may assume that tv-y is 
cyclically reduced in F(A) and that w\ is conjugate to b\ in G. 

By Corollarv l2.6l W\{a) must contain at least one fourth of a cyclic permutation 
of one of the defining relations {vj{a)b~ 1 ^ , since wi(a) is cyclically reduced and 
is conjugate to b\ in G. This implies in particular that w\ contains every freely 
reduced word of length two in F(A) as a subword. Let now St be the wedge as 
above and choose a sequence 

St = To, Ti, . . . , r n = Ra 

such that each i\ can be obtained from by a fold. Note that each Ti is a core 
graph. Indeed, I\ is obtained by a sequence of folds from Tq = St, which was a 
wedge of loops labeled by freely reduced words, and hence no vertex, other than 
possibly the base- vertex, has degree 1 in IV Moreover, the cyclically reduced word 
Wi can be read I\ along a closed path based at the base-vertex, which implies that 
the base- vertex has degree > 1 in IV Thus, indeed, I\ is a core graph. Hence T„_i 
is a core graph that folds onto Ra with a single fold and wi can be read by a closed 
path in IV This contradicts Lemma T4. 31 □ 

Remark 4.5. The proof of Theorem 14.41 recovers the well-known fact that for any 
cyclically reduced primitive element g in F(a\, . . . , a^) there exists some reduced 
word w of length two such that neither w nor ui _1 occur as a subword of g. 

We conclude this section with a generalization of Lemma 1431 that will be needed 
in the proof of the main theorem. 

Lemma 4.6. Let T be a finite connected A-graph of rank k + 1 that folds onto Ra 
but does not contain an isomorphic copy of Ra as a subgraph. Then there exists a 
reduced word that is not a label of a path in T. 

Proof. We may assume that any applicable fold in T produces a subgraph that is 
isomorphic to Ra since we could otherwise just apply a fold that does not produce 
such a subgraph. As any word that was readable before the fold is also readable 
after the fold, it follows that the assertion for the new graph implies the assertion 
for the original one. 

Since T is obtained from Core(T) by attaching a finite number of finite disjoint 
trees, any reduced word readable in T has the form z' zz" where z is readable in 
Core(T) and where \z'\ + \z"\ < diam(r). Thus, if w is a nontrivial freely reduced 
word such that w M is readable in T with M := 2diam(T) + 1, then w is readable 
in Core(T). Hence any reduced word that is not readable in Core(T) has a power 
that is not readable in T. We can therefore assume that T = Core(T). 

Thus r contains a graph ^> which either folds onto Ra or onto Ra with a spike- 
edge. Furthermore T is obtained from ^ by attaching a subdivided segment or a 
subdivided lollipop as described in Lemma HOI (1). 

There are two cases: Case 1, where ^ folds onto Ra, and Case 2, where VP folds 
onto Ra with a spike-edge. We will give all details in Case 1 and leave Case 2 to 
the reader. No new arguments are needed to complete the proof. 
Case 1: the graph \I> folds onto Ra with a single fold. Recall (see proof of 
Lemma r4.3|) that w.l.o.g. \f has two vertices x and y, a loop edge at x with label 
Oi, an edge with label a\ from x to y and k — 1 more edges with labels a%, ■ ■ ■ , a&, 
see Figure[5] T is obtained from ^ by attaching an segment, respectively a lollipop, 
consisting of, say, / edges. 
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In the following we will denote the attached segment, respectively the attached 
lollipop, by s. By the label of s we mean the label of the segment if s is an segment 
and otherwise the label of the path that starts and ends at the vertex where s is 
attached to ^ and walks once around the lollipop. We can assume that the label 
of s is freely reduced as we are only looking at freely reduced words that can be 
read in T. We distinguish three subcases depending on the size of I, the number of 
edges of s. 

Case 1A: Suppose first that I > 3. Then we cannot fold an edge of s with an edge 
of ^ as such a fold could not produce a copy of Ra contradicting our assumption. 
Note next that this assumption implies that the label of s cannot be a power of a\ 
as we could otherwise apply a fold to Y without producing a subgraph isomorphic 
to Ra- The same argument shows, that if s starts (ends) at x then the label of s 
does not start (end) with the letter af 1 . 

Then any path that reads a^ 1 must either end at the vertex x or I must be a 
lollipop with non-degenerate stick and the path travels exclusively on the candy of 
the lollipop which itself is labeled by a power of a\ , see Figure [8] 




Figure 8. The label of the lollipop s is a^a\aia 5 1 . The word a[ 
can be read in and also on the lollipop. 

If we now pick aj ^ af 1 such that a J is not the last edge on the stick of lollipop 
if the lollipop has a non-degenerate stick (choose ^ 05 in the example drawn in 
Figure [SJ, then any path that reads a^di 1 must end at vertex x. 

Now pick a letter a% such there is no edge of vff with initial vertex x and label 
a|. It follows that a path reading the word ajaj~ z a| must terminate with the initial 
edge of s or s -1 . It follows that the word Oja^ L a e k a^ cannot be read in Y if is 

such that a|a^ is not the initial word of the label of s (if s starts at x) and not the 
initial word of the label of if s ends at x. This proves the claim as it implies 
that a-a^ l a £ k a^ is not readable in Y. 

Case IB: Suppose now that 1 = 2. If s does not fold onto <I> then we argue as in the 
case I > 3. Thus we can assume that one of the two edges of s folds onto an edge of 
'I'. As such a fold does not identify x and y it follows that this fold must produce 
a new loop edge attached to either x or y and that there must have been already a 
wedge of k — 1 circles at x or y. We distinguish three different configurations. 

Suppose first that s is a segment of length 2 from x to y. In this case an edge of 
s must be folded onto an edge from x to y to produce a loop. 

If the fold produces a copy of Ra based at y then \f must have consisted of a 
single loop with label a\ at x, a single edge with label a\ from x to y and k — 1 
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loop edges with labels 02,..., a& at y. The fold must add a loop with label a\ 
which implies that the segment s from x to y has label a\a\. This means that the 
path a 1 ' 1 a2 is not readable in V as no path is at the vertex y after reading a -1 , see 
Figure [9] 




Figure 9. A copy of Ra emerges at y 

If the fold produces a copy of Ra based at x then $ had w.l.o.g. k — 1 loop 
edges based at x with labels ai, 03, . . . , cifc, an edge with label ai from 2; to y and 
either (a) a loop edge with label based at y or (b) an edge with label a\ from x 
to y. The fold must produce a loop edge with label ai at x which implies that the 
segment s from x to y must have label a\a\ in case (a) or a^ 1 or a\a\ in case (b). 
In all of these cases it is clear that a^'a^ 1 is not readable in T, see Figure [TOl 




Figure 10. A copy of Ra emerges at x 

Suppose now that s is a loop of length 2. As the fold must produce a loop-edge 
at x or y, it must fold an edge of s onto a loop edge. The descriptions of VP are as 
in the case before. 

If a loop edge is added at y then the added loop-edge must have label a\ , and 
hence s must have been a loop based at y with label a^a^ 1 with i > 2. In this 
case the word a 1 " 2 a2 is not readable in T. 

If a loop edge is added at x then the added loop edge must have label 0,2 which 
implies that s must be a loop based at x with label a^a^ 1 with i ^ 2. It follows 
that a\a^ x is not readable in T. 

Suppose lastly that s is a lollipop whose stick and loop both have a single edge. 
In this case we can argue precisely as in Case 1A. 

Case 1C: Suppose now that 1 = 1. We distinguish the cases where s is a loop 
edge and where it is an edge from x to y. 

Suppose first that s is a loop edge. If we add a loop edge with label a,j to x then 
there must still be a reduced word of the form a^af 1 (with i^l) that cannot be 
read in T. 

Indeed, a path with label a^af 1 must be at the vertex x after reading a^ 1 . 
Since by assumption T contains no copy of Ra, there is some 2 < i < k such that 
there is no loop edge labelled by Oj based at x. Moreover there are not two non-loop 
edges with labels <Zj and a^ 1 originating at x. The claim follows. 

The same argument works if we add a loop edge with label different from a± at y. 
If we add a loop edge with label a\ to y then inspecting the two possibilities reveals 
that there always exists a word of type a^a^ that is not readable as we assume that 
not all loop edges are present at y. 
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Suppose lastly that s is an edge from x to y. If the edge is labeled by af 1 then 
there is a word of type a\a e ^ that is not readable in T. Thus we can assume that the 
edge s is labeled by an element different from af 1 . This implies that any path that 
starts with aj -1 is at the vertex a; after reading aj" 1 . Thus we can assume that any 
af 1 can be read by an edge emanating at x. For at least one the corresponding 
edge leads to y. Hence the word a^ai cannot be read in V. □ 

5. Crucial consequences of genericity 

Definition 5.1 (Critical words). Let A = {ai, . . . , a k } where k > 2. In the follow- 
ing we will call an A-graph critical if it folds with a single fold onto a the rose Ra 
or on Ra with a spike-edge. We further call a word in F(A) critical if it can be 
read in some critical A-graph. 

By the lemmas established in the previous section, critical words are highly 
non-generic in F(A). 

For the remainder of the paper wc fix k > 2, m > 2 and the alphabets A = 
{ai, . . . , afe}. B = {&i, . . . , b m }. In this section we will study elements in a group 
of type 

(!!) G = (ai, . . . ,a k ,bx, . . . b m | a L = Ui(b),bj = v 3 (a) for 1 < i < k, 1 < j < m). 

that can be written as products (in G) of the form 

(**) Woh ei w\h E2 W2 ■ ■ ■ ■ ■ Wk-ih £k Wk 

where the it), are critical words, h is some fixed element of G and et G { — 1,1} 
for all i. We will see that there exist exponentially generic subsets V C C m> A and 
U C Ck,B such that for (ui(b), . . . , Uk(b)) € U and (vi(a), . . . , v m (a)) G V these 
products are well-behaved. The results we obtain are the main tools for the proof 
of the main theorem. We will denote by i?* C F(A U B) the symmetrized closure 
of the set of defining relations in (!!). 

We will not specify at the outset an explicit list of conditions for V and U that 
are exponentially generic and that will ensure that the subsequent results hold. 
Rather we will make the several required choices for V and U along the way in the 
proof, and each of U, V will be constructed as the intersection of a finite number 
of exponentially generic sets. 

However, at the start we do require V and U to satisfy the following conditions. 
We choose < A < 1CT 100 and < Ai < 1CT 100 A. 

Condition 5.2. The following hold for U and V: 

(1) There are exponentially generic subsets V C Ca and U C Cb such that 
U C U k and V C V m . 

(2) For every (ui(b), . . . , Uk{b)) G U and («i(a), . . . , v m (a)) G V the set i?„ 
satisfies the C"(Ai) small cancellation condition, after symmetrization. 

(3) For every u G U and v G V we have \u\ B > 10 100000 /Ai and \v\ A > 
10 100000 /Ai. 

(4) For every v G V, every subword of v of length > Ai(|f | + 1)/1000 contains 
all freely reduced words of length 1000 in F(A) as subwords. 

(5) For every u €U, every subword of u of length > Ai(|u| + 1)/1000 contains 
all freely reduced words of length 1000 in F(B) as subwords. 

(6) If some defining relation r G i?* contains two distinct occurrences of a 
subword y then \y\ < Ai|r|/1000. 

Proposition 13.61 implies that exponentially generic subsets V C C m> A and U C 
Ck.B with the above properties exist. 
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As there are only finitely many isomoprphism classes of critical A-graphs, and 
for any such A-graph "J there exists a word in F(A) that is not readable in "J , 
Lemma 14.31 implies that we may also assume the following: 

Condition 5.3. If y G F(A) is a freely reduced word that is readable in some 
critical A-graph such that y is also a subword of some defining relation r G R* then 
\y\ < AiH/1000. 

Definition 5.4 (Exceptional words). We say that a word h in F(A) is exceptional, 
if it does not contain a subword of length iV/100 that is also a subword of some 
r e R*. 

We say that a reduced word h in F(A U B) is a non- exceptional if h is not 
exceptional, that is, if either h contains some hi or h G F{A) and h contains a 
subword of length iV/100 that is also a subword of some r G i?*. 

We will mostly be interested in products as in (**) above where w, G F(A) are 
critical and where h G F(A U B) is non-exceptional. To understand the possible 
cancellations in such products we need a series of lemmas. 

Lemma 5.5. Let h a non- exceptional word in F{A U B). 

For any freely reduced critical word t G F(A) let W l be the freely reduced form 
of the word hth. We write W l as a reduced product W l = W^W^W^, where W{, 
W| and W| are the (possibly empty) portions of h, t, h accordingly that survive the 
free reduction of the product hth. 

There exists a critical word w G F{A) such that for each critical reduced word 
w ^ w the following hold: 

(1) The word W w does not contain a subword y of a defining relator r £ ii, 
such that y has overlaps with both and W™ in subwords of length at 
least lOOAiiV; 

(2) If in the free reduction of hwh some letter from B cancels then h = 
ubfrigr2b~ e v~ 1 such that the following hold: 

(a) u and v are words in F(A). 

(b) n and r2 are words in F{B) and |ri|, |t~2 | > lOOAi-ZV — 1. 

(c) W w = ubingrzbretfbfrxg^b^v- 1 , with W? = ublngr % bJ E , W$ = 
Oj , W% = b^r\gr2b^ E v and the word ^b^ 6 aJjbfri is a subword of a 
defining relator from , so that condition (1) above fails for the word 
w. 

(d) W w = ub\r\gf2f\gr2b~ e v~ x where for i = 1,2 fj is the remnant of 
r t and \n\ > 90AiAT,- thus Wf = ub E r ig f 2 , Wf = 1 and Wf = 
r\gr<2,b~[ £ v . Furthermore r2?i is not a subword of a defining relator. 

(3) In the free reduction of hwh no subword y of one of the two occurrences of 
h cancels such that y is also a subword of some defining relator in R* and 
\y\ > lOOAiTV. 

Moreover there is at most one critical word id such that (2) occurs. 

Proof. We first show that there is at most one word w such that the conclusion of 
either (1) or (3) fails. We can assume that h contains a subword of a relation of 
length at least lOOAiiV as there is nothing to show otherwise. 

Let Y be the right-most subword in h such that Y is a subword of some defining 
relator r G i?* and that \Y\ = 99Ai N. Similarly, let Y' be the left-most subword in h 
such that Y' is a subword of some defining relator from i?» such that = 99AiiV. 

In what follows, when referring to Y we think of it as a subword in the first h in 
hwh and when referring to Y' we think of it as a subword in the second h in hwh. 
Recall that a critical subword of a defining relator has length at most AiTV/100. 
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If case (3) fails, then a subword of Y of length at least 96Ai-/V must cancel with 
the subword of Y' of the same length in the free reduction of hwh. Therefore, if (3) 
fails for two distinct w\ ^ w 2 , then either Y has the form Y = Y\y 2 = yiYi with 
< \yi\ = Is/sal < 4A7V or Y' has the form Y' = Y{y' 2 = y[Y{ with < \y[\ = \y' 2 \ < 
4 AN . It follows that Y or Y 1 is a periodic word, contrary to our assumptions about 
generic properties of ti and V. Thus (3) can fail for at most one w. 

Suppose there exist distinct w\ and w 2 as in the statement of the lemma such 
that (1) fails for W\ and (3) fails for w 2 . Again, in the free reduction of hw2h, a 
subword of Y of length at least 96AiA must cancel with the subword of Y'. If 
in the free reduction of hw\h a subword of Y or Y' of length > 70AiA cancels, 
then we get a contradiction similarly to the above proof that (3) can fail for at 
most one w. Hence an initial segment s of Y of length |s| > 29AAi and a terminal 
segment s' of Y' of length \s'\ > 29AAi survive the free reduction of hw\h. The 
choice of Y and Y' now implies that each of s, s' is a subword of the word y, where 
y is a subword of some defining relator r. Recall, however, that since (3) fails for 
hw2h, the words Y and Y' almost cancel with each other and hence they are almost 
inverse. More precisely, a subword of Y of length at least 96Ai A must cancel with 
the subword of Y' of the same length. This means that there is a subword t of s 
of length |g| > lOAiA such that i -1 is a subword of s'. Thus both t and t^ 1 are 
subwords of r G R* which contradicts the small cancellation assumption about (!!). 

Suppose now that (1) fails for two different w\ and w 2 and let 2/i, 2/2, ?"2 be 
the corresponding subwords and defining relators as spelled out in (1). By the 
assumptions in Condition 15.31 the word W 2 i can have an overlap with of length 
at most Axlnl/1000. Hence \W 2 l \ < Ax^l/lOOO for i = 1,2. 

We claim that \\W^\ - \W™ 2 \\ < AiA/10 for i = 1,3. Indeed, if that were not 
the case, then (3) would fail for either w\ or w 2 - But we have already established 
that it is not possible for (1) to fail for one w and for (3) to fail for a different w. 
Thus indeed \\W^\ - \W™ 2 \ \ < Ai AT/10 for i = 1,3. The fact that the subword Vi 
of W Wi overlaps each of W™ 1 and W^' i in subwords of length > A/10, together with 
the small cancellation assumptions, implies that r\ and T2 are cyclic permutations 
of the same relator r G R*. We now claim that Wf 1 = Wp for j = 1, 2, 3. If this 
were not the case, then r would contain subwords of the form a(3j and a/3'j where 
M, M > A|r|/100, where \/3\, \j3'\ < Ai|^|/1000 and where fi ^ /3'. This contradicts 
the small cancellation assumption on (!!). Hence, indeed, (1) can fail for at most 
one w as in the lemma. 

We have shown that there exists at most one word w such that either (1) or 
(3) fails. Note that there is at most one word w such that in hwh some letter bf 1 
cancels. If w — w or if w or w with the above properties docs not exist then there 
is nothing to show. Thus we can assume that there is a word w such that either 
(1) or (3) fails and that for w some letter bf 1 cancels. We have to show that this 
puts us into situation (2). 

Suppose that (3) fails for w. Let b be the right-most occurrence of a letter bf 1 
in the first h. Then the left-most occurrence of a letter from B ±x in the second 
h must be 6 _1 since in hwh these occurrences must cancel each other. Suppose 
first that b is to the left of the mid-point of Y in h. Then Y, Y' G F(A). Then all 
of Y, Y' have to cancel in both hwh and in hwh. This is impossible by the same 
argument as in the proof that (3) can happen for at most one w. Thus b is to the 
right of the midpoint of Y in the first h. Hence b has to cancel both in hwh and 
hwh, which is impossible, as noted above. 

Suppose now that (1) fails for w. Again, let b be the right-most occurrence of 
a letter 6^1 in the first h. Then the left-most occurrence of a letter from J5 ±1 in 
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the second h must be b~ l since in hwh these occurrences must cancel each other. 
Since w ^ w, these occurrences of b and 6 _1 do not cancel in hwh. Recall that the 
freely reduced form W w = W^W^W™ of hwh contains a subword y such that y 
is also a subword of a defining relation r £ FL, and that y overlaps with both W™ 
and W™ in subwords of length at least lOOAiiV. 

Suppose first that, up to cyclic permutation and inversion, r has the form 
a^~ 1 u»(S). It follows that exactly one letter, namely af 1 survives in the free reduc- 
tion of the segment between b and b~ x in hwh and, moreover, both the occurrences 
b and & _1 are inside of y. It follows that W™ ends with a subword c\ and Wf 
begins with a subword C2 which are of length at least lOOAiiV and which are the 
terminal (initial) word of some Ui(b) . Now genericity of u% implies that not more 
than a subword of length lOAiiV can cancel in c\Ci which puts us into (2). 

Suppose lastly that up to a cyclic permutation and inversion, r has the form 
b~ 1 Vi(a). If b occurs to the left of the mid-point of Y in h then more than half of Y 
cancels in hwh, which yields a contradiction as in the above argument that (3) and 

(1) cannot happen for different values of w. Hence b occurs after the midpoint of 
Y in the first h and, similarly, b~ l occurs before the mid-point of Y' in the second 
h. It follows that in order for (1) to occur at least one and therefore both of b and 
6 _1 must cancel contradicting the fact that letters bf 1 cannot cancel for different 
words. The last part of the proof implies the more detailed description of (2). 

This completes the proof of Lemma [5751 □ 

Lemma 5.6. Let h £ F(A U B) be a non- exceptional do-geodesic word and let w 
be as in the conclusion of Lemma \5.5l There exists a non- exceptional reduced word 
h £ F(A U B) such that h = h in G such that the following hold: 

(1) The conclusion of Lemma \5.5\ applied to h holds for the same w as before. 

(2) The word h does not contain a subword of a defining relator of length more 
than -^N. 

(3) For any n £ N the freely reduced form of (hw) n h does not contain a subword 
of a defining relator of length more than j^jN . 

(4) If h contains a subword of some defining relator r £ -R* of length N/100 
then in (hw) n h both the first and the last occurrence of h have subwords of 
length N/500 of some element of that survive the free cancellation (of 
{hw) n h). 

Proof. We start with h and then modify this word while preserving properties (1), 

(2) above and the fact that the word is non-exceptional. The final word obtained 
in this process is h. 

Choose u maximal such that the freely reduced form of hw has he form uhu^ 1 . 
Since h is non-exceptional and w critical, it follows that there exists h such that 
(hw) n h = uh n h (as words). Then h — uhv' 1 and therefore (hw) n h = u/i n+1 w _1 
(both as words). 

We first show that if (3) does not hold for h then we can find a subword in 
hwh that is a subword of a defining relator whose length is more than j§jN. Wc 
distinguish the cases the u is critical and that it is not. 

If u is critical then uh n h differs from h n+1 only by multiplication from left and 
right by at most 2 critical words. Suppose that uh n h contains a subword t of a 
defining relator of length ^ N that is not readable in uh n h. We assume that n is 
chosen minimal for t. If n > 3 then t is periodic, contradicting genericity. If t = 2 
then t contains a subword of length |JV twice, contradicting the small cancellation 
hypothesis. It follows that t < 1, as desired. 

If u is not critical then h = uhv^ 1 and therefore (hw) n h = (both as 

words). Now any subword of (hw) n h = M/i n+1 w _1 that cannot already be seen as 



20 



I. KAPOVICH AND It. WEIDMANN 



a subword of uh 2 v~ l would either contain a significant (in relation to N) suffix 
of u and a significant prefix of v~ x or would contain a significant subword of h n 
which is not a subword of h? . If the former happens then u and v must have a 
substantial common suffix as they only differ by left multiplication with a critical 
word, thus the relator must have a long subword that occurs twice, contradiction 
the cancellation properties of the presentation (!!) of G, and if the latter happens 
we argue as in the case of critical u. 

Thus we can assume that either (3) holds or that uhhu' 1 contains a subword 
that is a subword of a defining relator r whose length is more than j^qN. Genericity 

and (2) now imply that h = W1W2W3 such that W3W1 is a subword of r e R* 
of length at least j^N and that \Wi\ = \W 3 \, in particular r = W3W1W with W 
of length at most j^N. We replace h with h = WiWiW^W^ which does not 
change the element of G represented by the word. After free reduction of uhu~ l w~ l 
we obtain a new non-exceptional word that by construction satisfies (2). It also 
satisfies condition (1) as the modification was only applied if the conclusion of (I) of 
Lemma [5751 did not hold for w and in that case the conclusion of (3) of Lemma 15.51 
does not hold for w after the modification. 

Assertion (4) follows immediately from the above argument. This is true as (4) 
holds for the initial h and is preserved under modifications that are made. □ 

Lemma 5.7. Let h be as in the hypothesis of Lemma \5. 5\ and let w be a non-trivial 
freely reduced critical word. 

Let further W be the word obtained by freely reducing the product hwh . Write 
W as a reduced product W = W1W2W3, where W\, W2 and W3 are the portions of 
h, w and h~ x accordingly that survive a free reduction of the product hwh~ l . 

Then the following hold 

(1) The free cancellation does not involve a letter bf 1 of h^ 1 or a subword y 
of h^ 1 of length \y\ > Ai A/10 such that y is a subword of some defining 
relator. 

(2) Let y be a subword of W such that y is also a subword of some defining 
relator r that overlaps both W\ and W3 in W . Then the shorter of these 
two overlaps has length < Ai AT/10 and \ W 2 \ < Ai AT/1000. 

Proof. (1) The first claim is obvious while the later follows from the assumptions 
in Condition 15.21 and Condition 15.31 

For part (2), suppose that y overlaps both W\ and W3. Hence W2 is a subword 
of y and hence of r. Therefore Condition 15.21 implies that | W2 1 < Ai|r|/1000. 
Suppose that (2) fails and that the overlaps of y with each of Wi, W3 have lengths 
> Ai|r|/10. Thus y = J/1W2J/3 where y\ is a terminal segment of Wi, where 2/3 is an 
initial segment of W 3 and where \y 3 \ > Ai|r|/10. Recall that W = W1W2W3 
is the freely reduced form of hwh^ 1 . Since | W2 1 < Ai|r|/1000, the definitions of 
Wi, Wi, W 3 then imply that there is a subword yo of y\ of length |y | > Ai|r|/30 
such that t/q -1 is also a subword of y 3 . Hence r contains two disjoint occurrences of 
yo and y^ 1 , where \yo\ > Ai|r|/30. This contradicts the genericity assumptions in 
Condition [Ql □ 

We now have all the tools necessary to study products of type 
woh ei wih E2 ui2 ■ . . . ■ Wk-ih Sk Wk 

where h is non-exceptional and all Wi are critical. Possibly after replacing h by 
another non-exceptional word as in Lemma [5.6[ we may assume that the conclusions 
of Lemma l5.6l and Lemma [53] and hold with w provided by Lemma 15.71 Thus the 
following lemma covers all cases. 
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Lemma 5.8. Let h be non- exceptional word and w be a critical word such the 
conclusions of Lemma \5.5l Lemma \5.6\ and Lemma \5. 7| hold for h and w. 

Letp = woh ei wih S2 W2 ■■ ■ ■■Wk-ih ek Wk where all Wi £ F(A) are critical. Suppose 
that for some bi G B the word p represents an element of G conjugate to hi in G. 
Then p is conjugate to bi in F(A U B). 

Proof. If p cyclically reduces to bi in F(AlJ B), there is nothing to show. Thus 
we can assume that this does not happen. We denote the word obtained from the 
freely reduced form of p by cyclic reduction by p. It now follows from Lemma 12.71 
(3) that pp contains a subword that is ±qqq of some relation from R*. 

We will show that this is not possible. We distinguish two cases, namely the 
case that h contains a subword of length j^qN that is also a subword of a defining 
relation and the case that it does not contains such a subword but contains some 
letter from B ±1 . Since h is assumed to be non-exceptional, these two cases cover 
all the possibilities. 

Case 1: Suppose first that h contains a subword of length N that is also a 
subword of a defining relation from Recall that w is chosen as in Lemma \5. 51 
In the following we will denote the word obtained from (hw) n ~ 1 h by free reduction 
by h n . 

We may assume that p is of the form 

p = h^wi ■ w k -ih £ £wk 

where £j = ±1 for 1 < i < k and that Wi are critical words such that Wi ^= w s ' 
if Si = £j+i, otherwise we could replace the subword h £ j i i ' w ihj* +1 1 = hjl^'h'ji ! by 
hj i+ j i+1 . We may further assume that this product is freely reduced, i.e. that 
Wi / 1 if £j = — £j+i, and that it is cyclically reduced, i.e. that either £\ = Sk or 
that £\ = —Ek and ^ 1. 

By Lemma 15.61 holds, the word h n does not contain a subword which is more 
than yj|j of a defining relation. By part (4) of Lemma 15.61 any h n contains two 
subwords of relations of length at least |r|/500 inherited from the first and last 
occurrence of h by the free cancellation of the product (hw) n ~ 1 h. In particular we 
can choose words U and V that are the leftmost, respectively rightmost, subwords 
of h n that represent of a defining relation. Then these words are the same for 
all h„ with n > 1. 

It now follows from Lemma 15.51 and Lemma 15.71 that none of the occurrences of 
t/ ±1 and V ±x (which are subwords of the h^*) cancel completely in the free and 
cyclic reduction of p 2 to p 2 . 

It also follows that the remnants of the and coming from two adjacent 
h E / and h e / +1 do not lie in a subword of p 2 that is also a subword of a defining 

Ji Ji + 1 * ° 

relation. 

Since the words Wi do not contain subwords that are greater than 10 q 00 portions 
of defining relations, it follows that any subword of a defining relation that occurs 
in pp is of length at most ^ + 2^Lj + 2 10 q Q0 of that relation. Since this number 
is smaller than ^jj^ , it follows that p cannot represent an clement of G conjugate 
to bi in G. 

Case 2: Assume now that h contains a letter of B^ 1 but docs not contain a 
subword of length N that is also a subword of a defining relation. 

Let u be the maximal prefix of h that is a word in F(A) and let u -1 be the 
maximal suffix of h that is a word in F(A). Thus either h = ubfwtfjv' 1 or h = 
ubfv -1 . Let w be the unique reduced word in F{A) such that v~ l wu reduces to 
the trivial word. 
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Now let Z n obtained by the word obtained from (hw) n h by free reduction. Then 
Z n does not contain a subword of length j^N that is also a subword of a defining 
relator as such a word would need to be periodic because of the assumption that h 
does not contain more than of a relation. 

Moreover, the word Z n has a prefix vh\ and a suffix tijV -1 and hence Z n contains 
a letter from B ±x . We can assume that p is of the form 

p = Z £ ^w 1 ■ w k -iZj*w k 

where £j = ±1 for 1 < i < k and the Wi are reduced critical words such that Wi ^ 
w 6i if Si = £j+i; otherwise we could replace the subword Z^WiZ^^ = Z^w Ei Z^ 

by Zji+ji+i- 

Therefore at least one letter from B ±x of each Zj* and one letter from A^ 1 of each 
Wi do not cancel in the free reduction of p. Thus, since each of these factors does 
not contain more than ^ of a defining relation, the freely reduced word obtained 
from p 2 cannot contain i of a defining relation. This contradicts the fact that p 2 
is conjugate to b\ in G, thus ruling out Case 2. □ 

Lemma 5.9. Let z be a reduced word in F(A U B) and suppose that z £ F(A) is 
an exceptional word representing the same element in G as z. Then the following 
hold: 

(1) There exist words ao, . . . , a<, f3\, . . . , f3 t (nontrivial all except possibly 
and at ) and words (3\ , . . . , /3 t such that the following hold: 

(a) We have z = ao/3i«i ■ • ■ Ptctt (equal as words). 

(b) We have z = aofSiai ■ • ■ Ptott (equal as words). 

(c) We have < and > 10007V for all i, where N is the 
length of the defining relations of G. 

(2) If u and u are maximal words such that z = uwu~ l and z = uwu -1 (equal 
as words) for some words w and w then \w\ < \w\. 

(3) Ifa^ is a letter of u such that neither this occurrence of all nor corresponding 
the corresponding occurrence of a^ v in fall in subwords /3j of z then in 
z the letter occurs in u and the letter a" 1 ' occurs in u . 

Proof. Recall that the presentation (!!) for G satisfies the C'(Ao) condition with 
A = 1/1000. 

By a Xo-reduction on a word in the generators of G we mean replacing a subword 
of that word, that is also a subword of one of the defining relations r of length 
> (1 — 3Ao)|r|, by the complementary portion of r. Recall that, by Definition 12. 31 a 
freely reduced word in F(AUB) is X Q -reduced if it does not admit any Ao-reductions. 

Note that by definition of being exceptional, the word z € F(A) is Ao-reduced. 
Observe now that if z is any other freely reduced and Ao-reduced representative 
of z then z — z as words. Indeed, if z ^ z then the equality diagram for z =q z 
has the form provided by Proposition 12.41 and, also by Proposition 12.41 the word z 
contains a subword of a defining relation of length > 1/100 of that relation. This 
is impossible since z is Ao-reduced. 

Thus any maximal sequence of free and Ao-reductions applied to z must ter- 
minate with the word z, and z is the unique freely reduced and Ao-reduced word 
representing the same element as z in G. 

Let z =q uwu~ l be the representation of the element z € G provided by 
Lemma 12.71 Recall that by Lemma 12.71 the element of G represented by the word 
w is the shortest (in G) representative of the conjugacy class of z, the word uwu -1 
is freely reduced and Dehn reduced (in particular, Ao-reduced). Since both z and 
uwu -1 are freely reduced and Ao-reduced, it follows that z = uwu^ 1 as words. 
Moreover, we can obtain z from z by applying Ao-reductions and free cancellations. 



NIELSEN EQUIVALENCE IN SMALL CANCELLATION GROUPS 



23 



Note that since the element represented by the geodesic word w is shortest in the 
G-conjugacy class of z, part (2) of Lemma I5U1 is now established. 

Every Ao-reduction replaces a subword Q by a word of length < |Q|/100. A free 
reduction replaces a subword of positive even length by an empty word. Recall 
that z is the unique freely reduced and Ao-reduced representative of z. Moreover, 
since z ^ z, at least one Ao-reduction is necessary to get to z from z. Hence, 
by an inductive argument, we can show that z and z can be represented as z = 
aoflioti . . . t a t , z = aofiiai . . . fitat, where |ft| < jgolftl, where ft = G ft and 
where the process of Ao-reducing z to z consists in Ao-reducing each of ft to ft 
separately. This implies part (3) of Lemma 15.91 

The fact that |ft| > 1000A requires a more involved argument which we sketch 
below, omitting some of the technicalities. 

The process of Ao-reducing z to z can be conducted in stages. At the first stage 
we find a maximal collection of non-overlapping subwords y q , q = 1, 2, M, of z 
such that each of them is also a subword of some defining relator bj = Vj (a) (or a 
cyclic permutation of this relator or its inverse) of length comprising > (1 — 3Ao) 
fraction of that relator. Since z is a word in F(A), each y q is also a word in 
F(A). We then replace each of these subwords y q by the missing portions of the 
corresponding defining relators, which introduces M letters from {b\, . . . ,b m } 
into the word, and denote the result by z' . We then freely reduce z' to get a word 
Zi which is the result of the first stage. Note that no b^ 1 from z' get cancelled 
in freely reducing z' to z\ since otherwise one could show that the word z was 
not freely reduced. Also, we have t < M and each of y q is a subword of some ft 
from the conclusion of the lemma. Recall that the Ao-reduced form z of z does not 
involve any 6^1. 

In the word z\ obtained after the first stage the only applicable Ao-reductions 
are those that involve large portions of the defining relators a,j = Uj (b) (with some 
technical exceptions which we suppress for the purposes of this sketch). 

At the second stage, we apply a similar process to Z\ by finding a maximal 
collection of disjoint subwords x s of Z\ such that each x s is also a subword of some 
relation a,j = bj(b) comprising > (1 — 3Ao) fraction of that relation. Thus each x s 
is, except for a single letter, a word in {h, 6„ 1 } ± 1 of length > 0.99A. When 
tracked back to z, this causes, for each x s , a conglomeration of > 0.99JV of the 
original y q s into a single ft of length > (1 — 3Ao) N ■ 0.997V > 1000A for sufficiently 
large TV. Some of the bj introduced in z\ at stage one may not be contained in any 
of the x s at stage two. 

However, eventually all of the bj that were introduced in z\ at stage one have to 
disappear in the Ao-reduction process (since the word z involves only {ai, . . . , a/ £ } ±1 ) 
and one can show that the condition |ft | > 1000A holds for every i = 1, . . . ,t. 

a 



6. Proof of the main theorem 

This section is dedicated to the proof of the main result of the paper, Theorem ll.il 
from the introduction. While the strategy of the proof is similar to the strategy of 
the proof of Theorcm l4.4l the argument turns out to be significantly more involved. 

We will prove the following theorem which is the precise formulation of Theo- 
rem |TTT| from the Introduction. 

Theorem 6.1. Let k > 2, m > 2 and let A = {ai, . . . , a^} and B = {bi, . . . , b m } 
be free bases of F(A) and F(B). Then there exist exponentially generic subsets 
V Q C m ,A and U C Ck,B with the following properties. 
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Let (u\(b), . . . , Uk(b)) G U and (vi{a), . . . ,v m (a)) € V be arbitrary such that 
\ui\ = \vj\ for 1 <i < k and 1 < j < m. Let G be given by the presentation 

(!!) G = (oi, ...,ak,bi,...b m \ai = Ui(b),bj = Vj(a) for 1 < i < k, 1 < j < m). 

Then the (k + 1) -tuple (a\, . . . , a^, 1) is not Nielsen equivalent in G to a (k + 1)- 
tuple of form {b\, 62, *,•••, *)• 

The proof of Theorem l6.1l is by contradiction. Thus we assume that the (k + 1)- 
tuple (ai, . . . , o/s, 1) is Nielsen equivalent in G to a tuple of type (b\, b%, *, ■ ■ ■ , *). 
It follows that in F(A) the tuple (ai, . . . , a&, 1) is Nielsen equivalent to a tuple of 
type (wi(a), . . . , Wfe(a), Wk+i{a)) such that bi = Wi(a) in G for i = 1,2. 

Let T = (u>i(a), . . . ,Wk(a),Wk+i(a)) be a tuple with the above properties such 
that \wi \a + \w2 \a is minimal among all such tuples. 

Let St be the ^4-graph as in Section 2] and choose a sequence 

St = To, Ti, . . . , T/ = Ra 

of ^4-graphs such that r, is obtained from Ti-i by a Stallings fold for 1 < i < I. 
Choose q maximal such that T q does not contain a subgraph isomorphic to Ra as 
an A-graph. Put A := T q . Then A contains a subgraph \I> of rank k with at most 
k + 2 edges such that \& is a core graph and that ^, via a single fold, folds onto a 
subgraph of T q+ i that is isomorphic to either Ra or to Ra with a spike-edge, see 
the proof of Lemma T4. 31 Note that any word readable in ^ is critical. 

Note that Core(A) cannot coincide with \1/ by the same argument used in the 
proof of Theorem 14.41 Indeed, if Core(A) = "f, then by Lemma [4.31 there exists 
a reduced word of length 4 that cannot be read in Core(A). On the other hand, 
the cyclically reduced form of wi(a) can be read in Core(A) = Core(T q ). Since 
u>i(a) is conjugate to b\ in G, it follows that the cyclically reduced form of w\(a) 
must contain a large portion of some defining relation Vj(a)bJ . Hence, by the 
genericity assumptions on vj, the cyclically reduced form of u>i(a) contains every 
freely reduced word of length 4 as a subword, yielding a contradiction. Thus indeed 
Core(A) ^ 

In the following a lollipop is a loop (candy) with an segment (the stick) attached. 
We will allow the stick to be degenerate, i.e. the lollipop to be a loop. 

Lemma 6.2. The A-graph A has the following properties: 

(1) The subgraph A = Core(A) carries the fundamental group of A and one of 
the following holds (see Fiaure \Tl\) : 

(a) A can be obtained from ^ by attaching a (subdivided) segment 7 along 
both endpoints to distinct vertices of ^ . 

(b) A can be obtained from by attaching a (subdivided) lollipop along 
its unique valence one vertex if the stick is non- degenerate and an 
arbitrary vertex otherwise. 

(2) Either the base-vertex of A lies in A in which case A = A, or A is obtained 
from A by attaching a segment to A along one of its endpoints. The other 
endpoint of the segment is then the base vertex Xq of A. 

Proof. Part (1) follows immediately from the fact that A is a core graph of rank 
k + 1 and that is a core subgraph of rank n. Part (2) follows as any vertex of 
St = To i except possibly the base vertex, is the initial vertex of two edges with 
distinct labels. This property is preserved by folds and therefore holds for A. The 
claim is now immediate. □ 
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FIGURE 11. The two options for the graph A 



Note that in Figure QT] the graph \1/ consists of the fat lines. A trivial but 
important fact is that any of the words Wi (a) can be read by a path in A based at 
the base vertex since the map p from St to Ra factors through A. This path is 
clearly reduced as the words Wi{a) are reduced. 

Convention 6.3 (Pivotal path and pivotal word). Let the word z £ F(A), called 
the pivotal word, be defined as follows: In the case (l)(a) of Lemma 16.21 put z to 
be the label of the segment attached to \&. In the case (l)(b) of Lemma W% put z 
to be the label of the closed reduced edge-path in A based at the attaching point 
of the lollipop that walks once around the lollipop, i.e. it is a word of type uwu~ x 
where u is the label of the stick of the lollipop. 

Thus in both cases z is the label of a path Z in A which we call the pivotal path. 

Let now z be a geodesic word in G over [A U B) ±x such that in G we have the 
identity z =q z. We will distinguish two cases, namely that z is exceptional and 
that z is non-exceptional. 

Case A: The word z £ F(A U B) is non-exceptional. 

Possibly after replacing z with another non-exceptional word representing the 
same clement we can assume that for some unique critical word w the conclusions 
of Lemma Lemma 15.51 Lemma 15.61 and Lemma 15 . 71 hold for z and h. 

Let now x\ be a base point in ^ that is incident to the segment or lollipop with 
label z. We study elements represented by closed reduced paths in A based at 
x\. It follows from the construction that for i = 1,2 there exist closed reduced 
paths oil, «2 in A based at x\ with labels pi,f>2 such that for i = 1, 2 the word pt 
represents an element of G conjugate to bi in G. The description of A in Lemma 16.21 
and the definition of z imply that p\ and pi are words that are freely equivalent to 
words of type 

w z £l wi ■ .. . ■ W n -iZ 6n W n 

where the Wi are reduced words read by reduced paths in ^ and Si G { — 1,1} for 
all i. In particular the words Wi are critical. As in G we have z = z, it follows that 
there are also words p\ and pi of type 

WQZ ei W\ ■ . . . ■ w n -iz en w n 

with the Wi as before that represent elements conjugate to b\ and 62. Note that 
the obtained words might no longer be freely reduced even if the initial ones were. 

Suppose now that p\ = wqz £i w\ ■ . . .■w n -iz en w n . It now follows from Lemma [5.8l 
that px cyclically reduces to b\ in F(A\J B). The proof of Case 2 of Lemma [5.81 
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implies that of each factor z^ 1 at least one letter bf 1 does not cancel. This im- 
plies that wiz^wi must cyclically reduce to b\ where W\ and W\ are words in 
F(A). Therefore z contains a single letter of type bf 1 , namely bf 1 . The same 
argument applied to P2 implies that z contains bf 1 which yields a contradiction. 
This concludes the proof of Theorem 16.11 for Case A. 

Case B: The word z is exceptional (so that, in particular, z £ F(A)). We distin- 
guish the cases that z = z and that z ^ z (as words). 

Subcase Bl: Suppose that z ^ z. The aim in this case is to get a contradiction 
to the minimality assumption on the sum of the lengths of w± and W2 ■ The under- 
lying idea is simple: we just replace in A the segment (respectively lollipop) with 
label z by an segment (respectively lollipop) with label z and then carry out the 
corresponding replacement in the loops of St upstairs. Since in this case the word 
z is considerably shorter than z, this yields a contradiction with the minimality 
assumption on the pair (wi, u^)- However taking the basepoint into account makes 
the proof more technical. 

Since Lemma 15.91 applies to the transition from z to z, we choose the words /3, 
and a, as in the conclusion of Lemma l5.9l Part (lc) of Lemma l5. 91 implies that z is 
significantly shorter than z, namely by at least ^ |/3,| > 990 • k ■ \vi\. We denote 

EIAI byi. 

Any vertex x of the pivotal path Z, except the initial and terminal vertex of 
Z, corresponds to a subword of length two of z which is read along the two-edge 
subpath eie2 of Z when reading z, where x = t(e\) = o(e2). In the case of the 
lollipop and the vertex lying on the stick there are two such two-letter subwords of 
z associated with the vertex x. 

We say that a vertex on the pivotal path Z is inessential if none of these at most 
two subwords is a subword of one the Pi. Note that, by definition, the initial and 
the terminal vertices of Z are inessential. 

We perform some changes to St and A; these changes do not alter the following 
facts: 

(1) St folds onto A and A folds onto Ra- Thus St folds onto Ra- 

(2) The first two loops of St are labeled by words in the af 1 that represent b\ 
and &2- 

The conclusion then follows from the fact that after the final modification the 
length of the first two loops of St has decreased. 




Figure 12. Fat parts correspond to paths read by the /3j 
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Let y be the vertex of A = Core(A) such that d(vo, y) = d(vo, A). Thus y = vq 
if «o G A and y is the attaching point of the segment joining A and «o otherwise. 
We now choose a new vertex y' 6 A so that the following hold (see Figure IT2j) : 

(1) The vertex y' is inessential. 

(2) We have d{y, y') < 2L. 

(3) If y is in distance less than L from the stick of the lollipop then y' is on the 
stick of the lollipop (if we are in the lollipop situation) . 

The existence of such a vertex y' follows immediately from the definition of L. 
From now on we only provide figures for the lollipop situation as it is the more 
subtle one. 

Let [y', y] be an arbitrary geodesic in A. We now unfold along [y', y] (starting at 
y). In the end vo is attached to A by a segment of length d(vo,y) + d(y, y') which 
intersects A only in y'. We denote the just constructed unfolded A by A+. Note 
that we have not changed "J or Core(A) since Core(A + ) = Core(A) = A. 




Figure 13. Unfolding A to A + 

Now St does not necessarily map onto A + but it does if we introduce at the 
beginning and at the end of each petal of St the trivial word that corresponds to 
the unfolding when going from A to A + . More precisely: 

If u is the word corresponding to [y, Vq] in A and v the word corresponding to 
[y',y] in A then the i-th petal in St has a label of type u~ l diU for some word 
di. This happens because the i-th petal in St gets mapped onto a path in A that 
starts and ends at vq, after it starts (before it ends) at vq it has no other option 
than going to y first (or must come from y). 




Figure 14. A loop of St 

We replace the i-th petal of St with a petal with label u~ 1 v~ 1 vdiV~ 1 vu. These 
new petals represent the same group elements as the old ones and now map to 
A + since the word u^v" 1 maps to the edge path [i>o,Z/'] in A + and the word 
u~ 1 v~ 1 v onto [uo>2/'] U [y',y] m A+. We further modify the newly constructed 
graph by performing the free reduction on the subword vdiV -1 that corresponds 
to backtracking on the segment, respectively lollipop. Thus we are left with petal 
labels of type u~ 1 v~ 1 pidiq~ 1 vu where pi is the prefix of v and q~ l is the suffix of 
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Figure 15. Constructing Sji + (only one loop drawn) 

Note that for i = 1, 2 the i-th loop of still represents the element bi in G. By 
construction, since d(y,y') = \v\ < 2L, we have increased the length of each petal 
by at most 8L. 

Now the subword Pidiq' 1 of the i-petal label in Sj, corresponds to a closed path 
in A based at y' and is therefore of type 

(4k) 9\"fouw ri u~ 1 ^i- ■ ■ uw ri u~ 1 ^fi92 1 

where: 

(1) The words 9\ and #2 correspond to reduced paths from y 1 to <J . In the case 
of a lollipop these paths might go around the lollipop a number of times. 

(2) The words 7 2 ; are read in ^> and are non-trivial for 1 < i < I — 1. 

(3) r, e Z - {0} and G {— 1, 1} if 2 is the label of an segment rather than 
the label of a lollipop. 

We now show that I > 20 for i — 1, 2. Indeed, after conjugating the word in (4k) 
by $2~ > we obtain the word 

Replacing z Ti = uw ri u~ 1 by z Ti = uw Ti u~ 1 and freely reducing the result produces 
a word in F(A) that still represents a conjugate of &i (or 62) and must therefore 
contain half a relation after free reduction. Now this word consists of the remnants 
of the ji and the uw ri u~ 1 . Now the 7^ arc critical and can therefore not contain a 
significant subword of a defining relation. Moreover the uw Ti u~ 1 do not contains 
more than ^-th of a defining relator by the assumption onz = uwu^ 1 and the fact 
that a subword of a defining relator cannot be periodic. It follows that I > 20. 

We now construct a new ^4-graph A* from A + . We first replace A = Core(A + ) 
by replacing the segment, respectively lollipop, with label z as follows. In the case 
of an segment, we replace it with an segment with label z and the same attaching 
points as before. In the case of a lollipop we replace the lollipop with a lollipop 
whose stick is labeled by u and whose loop based at the stick is labeled by w. In 
both cases walking along the segment or around the lollipop reads z = uwu -1 . Let 
A* be the modified A. 

Since the vertex y' was chosen to be inessential, it corresponds to a unique vertex 
of A* which we call y*. We now define A* to be the A-graph obtained from A* by 
attaching the segment [i>o,z/] (from A) to A* by identifying y' and y*. Note that 
in the lollipop case if y' was on the stick of the original lollipop then by part (3) 
of Lemma 15.91 is on the stick of the new one but the converse is not necessarily 
true. The A-graph A* still folds onto TZa since A* still contains ^> as a subgraph. 
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Figure 16. In A* the point y' = y* could move to stick 

Next we need to adjust S£ so that it maps onto A*. Recall that the labels of 
the petals in S£ are of type u~ 1 v~ 1 pid~iq~[ 1 vu where Pifaq~ x is of the form 

9ijouw Tl u~ 1 ji ■ ■ ■ uw ri u~ 1, fi82 1 

with 6*i and 62 words read by paths that start at y' and end in 'J. We will not 
change the initial subword u~ l v~ 1 or the terminal subword vu, nor will wc change 
the subwords 7*. We first replace the subwords uw Ti u~ 1 by uw ri u~ 1 . This reduces 
the length of the word by at least 20 • j^L. 

We will now replace 9\ and #2 by words 9\ and O2 that correspond to paths in A* 
whose initial vertex is y*, which have the same terminal vertices in as 9\ and 82 
and which are at most as long as 9\ and 62, respectively. We give the construction 
for 61, as the case for 9 2 is analogous. 

Since 6\ is freely reduced, it is a suffix of the word uw r u~ 1 for some r € Z with 
r G { — 1, 1} in the segment case. We choose r such that \r\ is minimal, w.l.o.g. wc 
can assume that r > 0. We distinguish a number of cases: 

If \r\ = 1 then 9\ is a suffix of z and the conclusion follows after replacing all 
subwords fa of 6\ by /?-, our choice of y' guarantees that that each fa either is a 
subword of 9\ or does not overlap with 9\ . The resulting word 9\ is clearly at most 
as long as 6\. This case deals in particular with the non-lollipop case. 

If 1 7- 1 > 2 and B\ = tw r u~ 1 , i.e. if y' lies on the stick of the lollipop, then y* 
also lies on the stick of the lollipop corresponding to uwu -1 by Lemma [5.91 (3) 
and we replace 9\ by iw r u~ l where twu -1 is the word obtained from turn" 1 as in 
the case r = 1. It follows that the resulting word 9\ is as most as long as 9\ as 
< \twu~ x \ and as \w\ < \w\ by part (2) of Lemma 15.91 (2). 

The remaining case is that 9\ = sw r ~ 1 u~ 1 where s is a suffix of w. Thus 
9\ = w r ~ 1 su~ 1 where w is the word corresponding to the loop of the lollipop based 
at y' . Let now 9 be the word obtained from su _1 as in the case r = 1 and w' be 
the path corresponding the closed path at y* in the new lollipop that goes once 
around the lollipop. Note it follows from the fact that y* is at least in distance L 
from the stick of the lollipop that \w'\ < \w\. We put 9\ to be the word obtained 
from (w') r ~ 1 9 by free reduction, clearly it is not longer than 9\. 

Let now be the graph obtained from 5^ by performing the above mentioned 
changes. The construction clearly preserves the fact that the map from S?p to A* 
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is 7Ti-surjective. Thus S T folds onto A* therefore onto Ra as A* fords onto Ra- In 
particular the loops of S T represent a tuples that is equivalent to (01, . . . , Ofc, 1) such 
that the first two elements represent in G the elements b\ and &2- Note however 
that the sum of the lengths of the first two petals has decreased as we first increased 
their lengths by at most 8L and then decreased their lengths by at least 20 ■ j^qL. 
This yields a contradiction with the minimality assumption. 

Subcase B2: We have z = z (so that z € F(A)). 

Recall that k is the cardinality of the free basis A of F(A). If the length of z 
is less than 1000/c 2 then there are only n = n(k) possibilities r' l5 . . . , ^' n r k \ for the 
A-graph A = Core(A). By Lemma 14.61 for each i = 1, . . . , n(k) the set of words 
readable in T[ is exponentially negligible in F(A). Since the union of finitely many 
exponentially negligible sets is again exponentially negligible, it follows that there 
exists an exponentially generic subset of F( A) (independent on the choice of z) such 
that for every word w from that subset no portion of w that is more than 1/100 of 
the length of uu can be read in Core(A). We can then get a contradiction similarly 
to the proof of Theorem 14.41 Namely, by construction there exists a freely reduced 
word w <E F(A) (labelling one of the petals in the graph St) whose cyclically 
reduced form w can be read in Core (A) and such that the element represented by 
to is conjugate to b 1 in G. Then by Corollary 12.61 the word w 6 F(A) contains 
a significant portion of a defining relation from R*, contradicting the genericity 
assumptions on presentation (!!). 

Suppose now that the length of z is at least lOOOfc 2 . Then, again arguing as in the 
proof of Theorem 14.41 we see that there exists a cyclically reduced word w <E F(A) 
that can be read in Core (A) such that w represents an element conjugate to b% in 
G. Then, by Corollary 12.61 w contains a subword w such that w is also a subword 
of one of the defining relations in (!!) constituting more than 1/100 of that defining 
relation. Thus, on one hand, the word w is generic and, on the other hand, w is 
equal, as a word, to a product of the form 

WqZ^WiZ^ 1 . . . WfZ^Wf 

where the words Wi are readable in \&. Since words readable in VP are highly non- 
generic in F(A) (because there exists a reduced word of length 2 in F{A) that cannot 
be read in we have \wi\ < |tu|/100 for i = 0, . . . , t. This implies that a definite 
positive proportion of to consists of the copies of z , which again contradicts 
genericity of w. 

This concludes the proof of Theorem 11.11 
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